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We present a study of the prospects for the measurement of TeV-scale light-flavored 
right-squark masses and the corresponding production cross section at a 3 TeV e + e~ 
collider based on CLIC technology. The analysis, performed in the framework of the 
CLIC Conceptual Design Report, is based on full Geant4 simulations of the CLICJLD 
detector concept, including standard model physics background and machine related 
hadronic background from two-photon processes. The events are reconstructed using 
particle flow event reconstruction, and the mass is obtained from a template fit built 
from generator-level simulations with smearing to parametrize the detector response. 
For an integrated luminosity of 2 ab _1 , a statistical precision of 5.9 GeV, corresponding 
to 0.52%, is obtained for unseparated first and second generation right squarks. For 
the combined cross section, a precision of 0.07 fb, corresponding to 5%, is obtained. 

1 Introduction 

Future high energy e + e~ colliders are precision tools for the discovery and the spectroscopy 
of new particles expected beyond the Standard Model. One attractive extension of the 
Standard Model is supersymmetry, which predicts a rich spectrum of new particles, one 
superpartner for each standard model particle. These new particles are expected to have 
masses in the range from about 100 GeV to a few TeV, and are thus coming within reach 
of modern colliders. In the next years, the LHC is expected to provide a decisive answer- 
on the question of the existence of TeV-scale supersymmetry. In typical scenarios, the 
superpartners of the light quarks (u, d, s, c) are among the heaviest superparticles, requiring 
energies in excess of 1 TeV for pair production. Early LHC results have already placed 
stringent limits on squark masses, reaching up to about 1 TeV in constrained models [l][2]- 
The study of these particles at an e + e~ linear collider requires a multi-TeV machine such 
as the proposed Compact Linear Collider CLIC [3J. 

Here, we consider the production and decay of light-flavored right-squarks in a R-parity 
conserving SUSY mSUGRA model [4j where the gluino is heavier than the squarks. In such 
a scenario, the right-squarks decay essentially exclusively into the lightest neutralino and 
their standard model partner, resulting in an event signature of two energetic jets and large 
missing energy, 

e + e~ q R q R -> qqXiXv 

The squark masses in the considered model are 1.116 TeV and 1.126 TeV for the first and 
second generation down-type and up-type squarks, respectively. The combined production 
cross section, taking into account the CLIC beam energy spectrum at 3 TeV, is 1.47 fb, with 
a ratio of 4:1 for up- compared to down- type particles. The mass of the lightest neutralino 
in this scenario is 0.328 TeV. 

In the framework of the CLIC Conceptual Design Report IE] , this process was studied as 
a benchmark to evaluate the physics performance for generic new physics signatures with 
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high-energy jets, missing energy and the rejection of high-cross-section Standard Model 
backgrounds. 

2 Experimental conditions and event reconstruction at CLIC 

The experimental conditions at CLIC, summarized in detail in [5], are characterized by 
high levels of coherent and incoherent e + e~ pair background as well as mini-jet background 
originating from 77 — > hadrons processes, due to the high luminosity and high collision 
energy. Together with the high bunch crossing frequency of 2 GHz, the latter leads to 
pile-up of hadronic energy deposits in the detector, in particular in the low-angle regions. 
This places strict requirements on the event reconstruction and the timing capabilities of all 
detector components. 

The events are reconstructed using the PandoraPFA particle flow algorithm 6|, which 
determines the timing of each reconstructed particle, allowing the rejection of out-of-time 
contributions most likely to be background. In addition to the event reconstruction itself, 
jet finding contributes to the elimination of 77 — > hadrons background. Several jet finders 
have been evaluated in the course of this study. The best performance was achieved with an 
exclusive k t algorithm u\ for hadron colliders, clustering the event into exactly two jets. In 
this algorithm, the two-particle distance is given by the pseudorapidity r\ and the azimuthal 
angle <f>, resulting in increased distances in the forward region, reducing the sensitivity to 
background particles. Different jet size parameters, which govern the amount of energy 
rejected by assigning it to beam jets, were studied, with the best performance observed for 
R = 0.7. 

3 Squark mass measurement technique 

The distribution of jet energies allows in princi- 
ple the simultaneous measurement of the squark 
mass and of the mass of the lightest neutralino 
from the upper and lower edge of the jet en- 
ergy distribution. However, this distribution, in 
particular the lower edge given by low-energetic 
jets, suffers significantly from standard model 
background, making precision measurements dif- 
ficult. 

Instead, it is assumed that the mass of the 
lightest neutralino will be measured with satis- 
factory precision in processes with higher cross 
sections and less background sensitivity, such as 
slepton production and decay [§] . With this ad- 
ditional knowledge, the extraction of the squark 
mass from distributions with a single kinematic 
edge becomes possible. 

Since the distribution of the center of mass 
energy at a 3 TeV CLIC collider has a substantial 
tail towards lower energies due to beamstrahlung, with only 34% of the luminosity in the 
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Figure 1: The Mc distribution including 
effects of the beam energy spectrum. 
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top 1% of the energy, methods which do not rely on the knowledge of the precise center of 
mass energy are advantageous (9l. 



One such technique is the variable Mc 10 , which uses the momenta of the two observed 



jets to form a modified invariant mass which is invariant under contra-linear boosts of equal 
magnitude of the two squarks, and thus independent of the center of mass energy. Mc is 
given by 



M c = JiE^ + E^-ip^i-p^f (1) 



= ^2{E 1 E 2 +p 1 -p 2 ), (2) 

where E\ 1 p\ and E%, p 2 are the energies and three momenta of the two visible final- 
state quarks (jets), respectively. The distribution of Mc, taken from generator-level events 
without background, but with the CLIC beam energy spectrum and with jet finding applied, 
is shown in Figure [l] It is bounded from above by 

2 2 

m% — mz 

M max = __9 X_ (3) 

rriq 

providing direct sensitivity to the squark mass. 

The construction of Mc assumes that the center of mass system of the collision is at 
rest in the detector system, which evidently is not the case at CLIC due to beamstrahlung 
and initial state radiation. Still, the boost of the collision system with respect to the labo- 
ratory frame is typically quite small, making it advantageous to use the complete available 
information, and not just transverse observables, as would be done at hadron colliders. The 
beam energy spectrum leads to a distortion of the edge of the Mc distribution, as discussed 
111 detail in (9). The best precision on the squark mass can thus be obtained from template 
fits which take the effect of the beam energy spectrum of CLIC into account. 

4 Simulation and signal selection 

To study the physics performance of CLIC for squark production, signal and background 
samples have been produced with the WHIZARD event generator [iT] with fragmentation 
and hadronization provided by PYTHIA [12], a nd have been fully simulated with a detailed 
GEANT 4 [13] model of the CLICJLD |5||14| detector. This detector model is based on 



the ILD 15 detector model for ILC, with some CLIC-specific modifications which account 
for the higher energy and the different background conditions. Before reconstruction, all 
events are overlayed with 77 — ¥ hadrons background corresponding to 60 bunch crossings 
(30 ns). In the event reconstruction, realistic timing cuts are applied inside the particle 
flow algorithm to reduce the impact of background. Finally, the events are clustered into 
two jets as discussed above. The present analysis is based on an integrated luminosity of 2 
ab _1 , with additional data sets produced for the training of multivariate analysis techniques 
discussed below. 

In addition to the reduction of 77 — > hadrons background, the rejection on non-squark 
physics background is one of the main challenges of the present analysis. Since the signal 
signature of two jets and missing energy is rather generic, high cross section standard model 
processes contribute to the background. The study of various background channels has 
shown that the standard model contributions which are hardest to reject are those with 
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genuine missing energy from neutrinos in the final state. The three major background 
channels are ttvv, qqvv and qqev. All three have cross sections which are 2 to 3 orders of 
magnitude above the cross section of light-flavor right-squark production. 




Figure 2: Left: Mq distribution stacked with all considered backgrounds after a cut on 
missing p t > 600 GeV. Right: Signal and combined background distribution after cut on the 
output of the Boosted Decision Tree. The fit is used as a parametrization of the background 
distribution to allow a background subtraction for the mass determination. 



A high signal purity, in particular in the region of the kinematic edge of the distribution, is 
crucial to obtain a precise mass measurement. This requires a reduction of the background 
by more than a factor of 1000. A cut on missing transverse moment can provide quite 
effective background rejection at the first stage of the analysis. Requiring a measured missing 
Pt > 600 GeV reduces the dominating background channels to approximately 10 -2 , in the 
case of the ttvv final state even to 2 x 10 -3 of their original cross section, while reducing 
the signal to 0.485. 

This reduction, however, is insufficient to provide a clean separation of signal and back- 
ground, as shown in Figure [2] left, where background processes still dominate the signal 
by more than an order of magnitude. Further reduction of standard model background is 
achieved with a boosted decision tree (BDT), as implemented in the TMVA toolkit [16] . 
It provides a separation of signal and background using event shape information, particle 
multiplicities and energy distributions as well as other discriminating variables. The BDT 
is trained with signal and background samples not used in the analysis. With the BDT 
selection, a clean identification of the signal is achieved, demonstrated in Figure [2] right. 
The overall signal efficiency is 36.1%, with a signal significance of S/y/S + B — 25.7. The 
background rejection procedure leaves the upper edge of the Mq distribution unchanged, 
providing the basis for precision mass measurements. The shape in the low Mq region is 
significantly altered, primarily due to the missing p t cut, which precludes low Mc values. 
Since this region of the distribution does not provide sensitivity to the squark mass, this 
does not affect the measurement. 

For the determination of the squark mass using a template fit, the remaining background 
is subtracted from the data distribution using a simple parametrization of the background 
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shape determined from a statistically independent background sample. 



5 Results 



500 



i , , | , , , , | , r 

Closest Template 

Measurement (BG subtr.) 

Template m = 1130.9 GeV 




1000 



1500 

M c [GeV] 



250 
200 



1 i ■ 1 1 1 i 1 1 1 1 i 1 1 1 ■ i 

X 2 vs template mass 

— a — X 2 result 




1050 1100 1150 1200 1250 



m, 



squark 



[GeV] 



Figure 3: Left: The Mq distribution for the measurement (background subtracted), com- 
pared to the template with the lowest x 2 - Right: % 2 between measurement and template as 
a function of the squark mass in the templates. The distribution is fitted with a bifurcated 
parabola, with the minimum giving the measured squark mass. 



It is possible to extract the mass of the squarks using the upper edge M™ ax of the 
Mc distribution. For a reliable fit of the edge, the detector resolution, distortions due to 
the beam energy spectrum and the influence of machine-related backgrounds need to be 
accounted for in the fit function. Still, even small background contributions in the region of 
the edge can have a significant influence, resulting in biased results. 

It thus seems advantageous to use a template fit instead, which allows the inclusion of 
the above-mentioned effects in the generation of the templates. In such a fit, the mass 
is determined by comparing the observed distribution with high-statistics signal templates 
generated for various different squark masses. An additional advantage of a template fit is 
that the complete Mc distribution enters into the fit, not just the high- Mc edge (although 
the edge also is the driving region in a template fit), potentially leading to reduced statistical 
errors for statistically limited samples and resulting in higher stability against remaining 
background contributions and statistical fluctuations. 

In the model used for this analysis the up- type right-squarks (ur,cr) with mass mj 
are about 10 GeV heavier than their down-type counterparts (da, sr) with mass mg. Since 
the present analysis is unable to distinguish between up and down type squarks, the mass 
measurement will give the result of the mean m squar k, weighted with the respective pro- 
duction cross sections. Since up-type squark production has an approximately four times 
higher cross section than down-type squark production, the mass value is dominated by the 
up- type squarks. 

For the template generation, a similar mass splitting between up- and down-type squarks 
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of exactly 10 GeV was used. The templates were created in steps of 3 GeV ranging from 
1050 GeV to 1248 GeV. 

In order to minimize statistical fluctuations in the templates, each of these mass points 
was generated with 50 000 events, corresponding to an integrated luminosity of C — 33.6ab _1 
at the true squark mass. Due to computational limitations it was not possible to perform a 
full simulation and reconstruction of these 3.3 million events. Instead, detector effects were 
included on generator level. 

As a first step, acceptance was taken into account by rejecting particles with | cos#| > 
0.995 or p < 100 MeV. Then, jet clustering was performed using the same algorithm as 
used in the rest of this analysis. To account for detector resolution effects, the reconstructed 
jet energies are then smeared with a Gaussian, assuming a jet energy resolution of 4.5%. 
This resolution, as well as a small off-set in Mq originating from the presence of 77 — ¥ 
hadrons background which is not included in the generator- level templates, is determined 
by comparing the Mq distribution for a template generated with the true mass values with 
a fully simulated high-statistics signal sample. The generated events are passed through 
the event selection procedure, including missing momentum requirements and the boosted 
decision tree, to fully reproduce the effects of the analysis procedure on the distribution. 

The template fit itself is performed by comparing the Mq distributions of different tem- 
plates and the background-subtracted measurement using a binned \ 2 with a free overall 
normalization of the template. The template with the lowest \ 2 is shown in Figure [3]fe/f, 
compared to the data distribution. From the dependence of the x 2 on t ne template mass, 
shown in Figure [3]rz^ft,£, the measured squark mass is determined through a fit with a bifur- 
cated parabola. The statistical errors are determined with a toy MC, resulting in 

m squark = 1127.9 GeV ± 5.9 GeV (stat) , 

in good agreement with the cross-section averaged generator value of 1123.7 GeV. The 
statistical error for an integrated luminosity of 2 ab -1 corresponds to 0.52%. The error 
on the neutralino mass, taken from other measurements [8] also enters into the present 
measurement. For the particle masses considered here, a 1 GeV uncertainty on the neutralino 
mass translates into a 0.54 GeV uncertainty on the squark mass, resulting in an error of 
1.8 GeV for the precision expected from slcpton measurements with an integrated luminosity 
of 2 ah" 1 . 

Using the selection efficiency obtained from the training phase of the boosted decision 
trees, the cross section was determined from the integral of the background-subtracted Mc 
distribution. Here, a statistical error of 0.07 fb, corresponding to 4.6%, was achieved. 

Systematic errors have not yet been evaluated thoroughly. A first study of possible sys- 
tematics originating from the precision with which the beam energy spectrum can be mea- 
sured has shown that, due to the use of the variable Mc, the effect is negligible compared to 
the expected statistical errors. Larger effects are expected from detector and reconstruction 
uncertainties, such as the jet energy scale. 

6 Conclusions 

A 3 TeV e + e~ collider based on CLIC technology allows the measurement of TeV-scale 
light-flavored right-squarks through the decay into a quark and the lightest neutralino, the 
dominant decay channel if the decay into gluinos is forbidden. Using a combination of missing 
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energy and multivariate classifies it was possible to achieve a high signal significance despite 
standard-model background processes that exceed the signal production cross section by 
almost four orders of magnitude. The machine-related background from 77 — > hadrons 
processes could be controlled by timing cuts in the reconstruction and by a suitable choice 
of the jet finder. For full detector simulations of the CLIC CDR SUSY benchmark model 
with light-flavored right-squark masses of around fl25 GeV, a statistical precision of 5.9 
GeV, corresponding to 0.52%, is achieved for combined up- and down-type squarks with 
an integrated luminosity of 2ab _1 using a template fit with generator-level templates. For 
the same data sample, a statistical precision of 5% is achieved for the total production 
cross section, demonstrating that precision measurements of the properties new, strongly 
interacting particles are possible at CLIC in a rather generic new physics signature of two 
energetic jets and missing energy. 
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